Inject a latent vector into network:
Concatenate or add it to encoder layer [2]
Concatenate or add it to the bottleneck [3]
Add to decoder layers using AdaIn [4] (without skip connection), [5] (with skip connection)
Inject a latent map into network:
For concatenation or addition, spatially stack latent codes [9]
Reference
[1] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
[2] Toward Multimodal Image-to-Image Translation
[3] Zheng, Chuanxia, Tat-Jen Cham, and Jianfei Cai. “Pluralistic image completion.” CVPR, 2019.
[4] Karras, Tero, Samuli Laine, and Timo Aila. “A style-based generator architecture for generative adversarial networks.” CVPR, 2019.
[5] High-Resolution Daytime Translation Without Domain Labels
[6] Antoniou, Antreas, Amos Storkey, and Harrison Edwards. “Data augmentation generative adversarial networks.” arXiv preprint arXiv:1711.04340 (2017).
[7] Tamar Rott Shaham, Tali Dekel, Tomer Michaeli, “SinGAN: Learning a Generative Model from a Single Natural Image”, ICCV2019
[8] Lee, Hsin-Ying, et al. “Diverse image-to-image translation via disentangled representations.” ECCV, 2018.
[9] Yazeed Alharbi, Peter Wonka: Disentangled Image Generation Through Structured Noise Injection. CVPR, 2020.
[10] Park, Taesung, et al. “Semantic image synthesis with spatially-adaptive normalization.” CVPR, 2019.
[11] Sushko, Vadim, et al. “You only need adversarial supervision for semantic image synthesis.” ICLR, 2021